RAW SEO T T ™ra LISTING 
ERRQRREPORT 



BIOTECHNOLOGY gff y -A 

SYSTEMS- 
BRANCH ^. 




Th, Biotechnotoey Systems Branch of the Scientific and.TedmicI Information 
Sntcr (STCQ detected errors when proccs S ing the MIowU. computer readable 

form: " 

ApplicationSerialNumber: . i^fr/^/f 

Source: — '- — 7 

Date Processed by STIC: — 



3 3d ^.<ne)/ : - 



THE ATTACHED PRINTODT EWLADjS j*^^™?*^ BYErnrER: 

TO r N cS?«Sestions, please contact markspencer, io^m. 

TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
The Checker version J - W .*JJJJ uw . intM fece to check whether a sequence listing is m 
employing a logical and int ^^ e ^ s cScr Version 3.a works for sequence listings 
compliance with format f 1 ^^™ 1 ^^^! - 1 825 effective October I. 1990 (old 
generated for the ongma ^L^^£^ ,i, y 1998 as well as World Intellectual 
rules) and the revised version (new rules; errccuv., 

Property Organization ^ IVO )^^uJdOS-^ version of Checker, and is Y2K- 
Checker Version 3.0 replaces the previous in Computer Readable form 
compliant. Checker allows ; pubUe ^^S^^Sdomaik Office (USPTOl. 
(CRF) before submitting them to ^ "^.™^ BCttd t0 resu lt in fewer errored sequence 
Use of Checker prior to filing the. sequence listing is expew-cu w 
listmgs, thus saving time and money. 

~ - « ™- he d.« n M** *» MSfT" •«■»*. ■»< *■ MMi BUlMiaB 

^ http://www.uspto.eov/web/officc^c^^ker-^ 



Raw Sojuenct Listing Error Smwu'ry 



SERIAL, NUMDERt V 



ERROR DETECTED SUGGESTED CORRECTION 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS. WHICH WERE INSERTED DV FTO SOFTWARE 

I Wrapped Nucleic* The numberAext at the end of each line "wrippctT down fo the next line. This may occur if your file 
Wrapped Arainoj retrieved in 1 Word processor after Crcjtinfe it Please adjust your right mirgm lo J; this vdll 

prevent "wrapping." 

- 2 ' Invalid line Length The rules require llul ■ line not exceed 72 character* in length.' This Includes while spaces. 




Misaligned Amino 
Numbering 



Non.ASCIl 



Vuisblc Length- 



Pilenlln 2.0 
-tug" 



Skipped Sequence* 
"(OLD RULES) 



Skipped Sequences 
"(NEW RULES) 



Use of n'l or Xu'i 
"(NEW RULES) 



10 Invalid C2 1 3> 

Response 



Use of<220> 



12 



_Patentin 2.0 
"bug" 



The numbering under each 3* amino add h_ misaligned. Do not use Ub codes between numbers; 
use ipitcc character!, instead. •', , ' 

The submitted file was not *aved in ASCII(DOS) less, as required by the Sequence Rules. Please 
ensure your aubsequtMt submission l> aavtd In ASCTI Uxl. ' 

Scqucnccft) ^T^c onuin n*i or Xaa/»represcnfjng more that* one residue.. Per Sequence Rule*, . . 
each H o r X»» (in only represent a tln^lt residue. Please present the maximum dumber of etch 
residue having variable 1 length and indocile in ihe <220>-<223> section Uu( tame may be missing. 

A "bug" in Fatcntln version 2.0 has eyped 0ie <220>-<223> section to be reining from amino acid 
sequences^*) . Normally. PrtertW wMldTamofnaUcal^^'enenle this section from the 

previously coded nucleic acid sequence. Please manually copy th» «kvin) <22Q*-<333» section to 
ihe subsequent amino »eid jequence. This applies lo lite mandatory <220>-<223> sections for 
Artl flcbl or Unknown *<q uenees. 

Sequencc(s) . missing. If intentional, please insert the folloyting lines for each skipped sequence: 
(2) INFORMATION FOR SEQ ID NO-.X: (insert SEQ ID NO where "X" is 'shown) 
(i) SEQUENCE CHARACTERISTICS: (Do upt insert any subheadings under this heading) 

(Ki) SEQUENCE DESCRlfTlON:SEQ ID NO:X: Cmsert SEQ"iDJTO where "X" it shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response 10 Includ e the skipped sequences. 



Sequencers), 



missing, tflnlenllanal. please insert Ihe following lines for each skipped sequence. 



<1 1 0> sequence id number 
<400> sequence id number 
000 

Use of n'l and/or Xaa's have been detected in the Sequence Listing. 

Per |.623ofSequenCeRulcs 1 iueof<220>-<223>isMANDATORYifn'sorXas , s»represeni. 

lit <220> to <23J> section, please explain location of n or Xaa, and which residue n or X »» represents. 

Per 1.823 ofSequence Rules. Ihe only valid <213> responses ire: Unknown, Artificial Sequence, or 
scienlific'naine (Geiw/spocies). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequencers) _ 



_ missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of<220> to <22)> is MANDATORY if <2l3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source or genetic material in <220> to <223> section. 
(See "federal Register ~ O6/0 Ul 99 6. Vol. 63, No. 1 0<, pp. 2963 1-32) (See. 1.823 ofSequence Rules) 

Please da not Use. "Copy lo Disk" function ofPstentln version 2.0. This causes a Corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing)- Instead, please use "File Manager" or any other manual mew In copy file to floppy disk. 



13 ■ M isuse of n " a' can only be used to represent a single nucleotide in a nodcic acid sequence N is not used lo represent 

wiy.vfcluc not specifically a nucleotide: 

' AMCVWH - Biotechnology Syjtcmj Branch -Ot/21/7001 



TJegpe of errors'show^rfst throughout 
'We^elor similar errors. K 



U 



_ came * f— 



RAW SEQUENCE LISTING DATE; 08/30/2001 

PATENT APPLICATION: US/09/7 19 ' 261 TIMEr,*09 : 46 : 14 

Input Set : Ai\Mewbum-Matassa ('261) Sequence Listiltg.txt 
Output Set: Ni\CRF3\0830200l\l7i926T .raw 

4 <110> APPLICANT: Istituto Di Ricerche Dl Blologia Molecolare p. Angeletti SpA 

5 Matassa, Victor /' 

6 ' Names, Frank D3@@ NSl ^BfflBty 

I SSS: S-f OMWflWBIiiBftfiSfa 

9 Poma, Marco 

11 <120> TITLE OF INVENTION: Peptide inhibitors of hepatitis C' virus NS3 protease 
13 <130> FILE REFERENCE". KMN/FP5780044 
15 <140> CURRENT APPLICATION NUMBER: 09/719,261 
C--> 16 <141> CURRENT FILING DATE: 2001-07-23 

18 <150> PRIOR APPLICATION NUMBER: PCT/GB99/01824 

19 <151> PRIOR FILING DATE: 1999-06-09 

21 <150> PRIOR APPLICATION NUMBER: GB 9812523.0 

22 <151> PRIOR FILING DATE: 1998-06-10 
24 <160> NUMBER OF SEQ ID NOS : 13 

26 <17 0> SOFTWARE : Patentln Vex. 2.1 

29 <210> SEQ ID NO: 1 

30 <211> LENGTH: 4 

31 <212> TYPE: PRT 

32 <213> ORGANISM: Artificial Sequence ~ 
3 4 <220> FEATURE: 

35 <221> NAME/KEY: SITE 
3 6 <222> LOCATION: (1> 

3 7 <223> OTHER INFORMATION: xaa is diphenylalani4i£_ 

3 9 <220> FEATURE: 

4 0 <221> NAME/KEY: SITE 
41 <222> LOCATION: (3) 

4 2 <22 3> OTHER I ^FORMATION : xaa is cyclohexy lalanin e, 

44 <220> FEATURE: 

4 5 <221> NAME/KEY: SITE 

4 6 <222> LOCATION: (4) 

47 <223> OTHER INFORMATION: Xaa is 4 . 4 -dif lUQTO'2- amino bu tyric acid 

49 <220> FEATURE: 

50 <223> OTHER INFORMATION : Description of Artificial Sequence: Synthetic ^ 

51 sequence y 

53 <4 OOy' SEQUENCE: A 

w--> 54 xas^Glu xaa xaVf 
55 1 

59 <210> SEQ ID NO: 2 . 

60 <211> LENGTH: 6 

61 <212> TYPE: PRT 

62 <213> ORGANISM: Artificial Sequence 

64 <220> FEATURE i 

65 <221> NAME/KEY: SITE 

66 <222> LOCATION: (6) 

67 <223> OTHER INFORMATION: Xaa is 4, 4-di'f luoro-2-aralno "butyric acid 
69 <220> FBftTURE: - '- 



file://C:\CRF3\OutholdvVsrI71 9261 .htm 



Op/25/02 07:^1 fAA mo ouo 



RAW SEQUENCE LISTING DATE ; 08/30/2001 

. PATENT APPLICATION: US/09/719, 261 TIME: 09:46:14 

Input Set : As\Mewbuxn-Matassa ('261) Sequence listing. txt 
Output Set: N:\CRF3\08302001\l719261.raw 

70 <223> OTHER INFORMATION: Description of Artificial Sequence": synthetic; 

71 sequence . . 
-73 <400> SEQUENCE: 2 

W--> 74 Asp Glu Ket Glu Glu Xaa 
75 • 1 S 

78 <210> SEQ ID NO: 3 

79 <211> LENGTH: 6 \ 

80 <212> TYPE: PRT 

61 <213> organism: Artificial sequence 

83 <220> FEATURE: 

84 <221> NAME/KEY: SITE 

85 <222> LOCATION: (3) 

86 <223> OTHER INFORMATION: Xaa is diphenylalanine 

88 <220> FEATURE: 

89 <2 2 1> NAME/KEY: SITE 

90 <222> LOCATION: (5) 

91 <223> OTHER INFORMATION: Xaa is cyclohexylalanine 

93 <220> FEATURE; 

94 <221> NAME/KEY: SITE 

95 <222> LOCATION: (6) 

96 <223> OTHER INFORMATION: Xaa is 4 , 4 -dif luoro-2-amiuo butyric acid 

98 <220> FEATURE: 

99 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

100 sequence 
102 <400> SEQUENCE: 3 

W--> 103 A6p GlU Xaa Glu Xa-& Xaa 
104 1 5 

107 <210> SEQ ID NO: 4 

108 <211> LENGTH: 6 

109 <212> TYPE: PRT 

110 <213> ORGANISM; Artificial Sequence 

112 <220> FEATURE: 

113 <221> NAME/KEY: SITE 

114 <222> LOCATION: (3) 

115 <223> OTHER INFORMATION: Xaa is diphenylalanine 

117 <2 20> FEATURE: 

118 <221> NAME/KEY: SITE 

119 <2 2 2> LOCATION: (5) 

120 <223> OTHER INFORMATION: Xaa is cyclohexylalanine 

122 <220> FEATURE: 

123 <221> NAME/KEY: SITE 

124 <222> LOCATION: (6) 

125 <223> OTHER INFORMATION: Xaa is a fluorinated hydrocarbon side chain 

127 <220> FEATURE: 

128 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

129 sequence/ y 
131 <40 0> SEQUENCE: 4 y 

W--> 132 Asp Glu XaSt Glu X«£a Xa5, 
133 1 5 • 



fae://C:\CRF3\Outhold\VsrI71926I.htm 
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<-;. Page 3 of 7 



w--> 



RAH SEQUENCE LISTING 

PATENT APPLICATION : DS/0 9/719,261 



DATE: 08/30/2001 
TIME: 09:46:14 



Input Set : A:\Mewbuxn-Matassa. ('261) Se<juence ListjLng.txt 
Output Set: Ui\CW3\08302001\l719261.raw 



136 <210> SEQ ID NO: 5 

137 <211> LENGTH: 6 . 

138 <212> TYPE: PRT 

139 <213> ORGANISM: Artificial sequence 
141 <220> FEATURE : 

<221> NAME/KEY: SITE y 
<222> LOCATION: (6) ™ a y 



142 
14 3 
14 4 
146 
14 7 
148 



<rr\ 



3 



fluorinaced hYdroca rbon side chain 



<223> OTHER INFORMATION: Xaa is a 

<220> FEATURE: . 
<223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

sequence • 

150 <400> SEQUENCE; 5 

151 Asp Glu Met Glu. Glu Xaa 



152 
155 
156 
157 
158 
160 
161 
162 
163 
165 
166 
167 



1 

<210> 
<211> 
<212> 
<213> 
<220> 
<221> 



SEQ ID NO 
LENGTH: 5 
TYPE ; PRT 
ORGANISM: 
FEATURE: 
NAME/KEY : 



Artificial Sequence 



<222> LOCATION: 



SITE 

(1) 

<223> OTHER INFORMATTON: Asp as tertiary butyl ester 
<220> FEATURE: 
<221> NAME/KEY: SITE 
<222> LOCATION: (2, 4, 5) 

168 <22'3> OTHER INFORMATION: Glu as tertiary butyl ester 

170 <220> FEATURE: . 

171 <223> OTHER INFORMATION: Description of Artificial sequence: Synthetic 

172 sequence 

174 <4 00> SEQUENCE: 6 

175 Asp Glu Met Glu Glu 

176 1 5 

179 <210> SEQ ID TMO: 7 

180 <211> LENGTH: 5 

181 <212> TYPE: PRT 

182 <213> ORGANISM: Artificial Sequence 

184 <220> FEATURE: 

185 <2 2 1>. NAME/KEY: SITE 

186 <222> LOCATION: (1) 

187 <223> OTHER INFORMATION: 

190 <220> FEATURE: 

191 <221> NAME/KEY: SITE 

192 <222> LOCATION: <2, 4) 

193 <2231> OTHER INFORMATION: 

195 <220> FEATURE: 

196 <221> NAME/KEY: SITE 

197 <2221> LOCATION: (3) 
196 <223> OTHER INFORMATION; 
200 <:220> FEATURE: 



Asp as tertiary butyl eater 



Glu as tertiary butyl ester 



Xaa is 



cUphenylalaoine 



Tire type of errors shownMat throughout' 
the Sequence listing. PfSke check subsequent 
sequences for writer errors. 
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0.6/25/0^ U(,^x raA i 



SAW SEQUENCE LISTING DATE: 08/30/2001 

PATENT APPLICATION: TJS/09/719, 261 TIME t 09:46:14 



Input Set : A:\Mewbuxn-Matassa ('261) Sequence Listing.txt 
Output Set: Ni\CRF3\0B302001\I719261.raw 



201 


<221> 


NAME/KEY: SITE 




202 


<222> JiOCATION: (5) 




- 203 


<223> 


OTHER INFORMATION: Xaa is cyclohexylalanine 




1 ft r 

205 


<220> 


FEATURE: 




206 


<223> 


OTHER INFORMATION: Description of Artificial Sequence: 


synthetic 


i n *~j 

207 




sequence - 




209 


<40O> SEQUENCE : 7 . v 




W — > 210 


Asp Glu Xaa Glu Xaa 




Ai-X 


1 


5 




214 


<210> 


SEQ ID NO: 8 




215 


<211> 


LENGTH: 17 




jii-O 


<212> 


TYPE: PRT 




4-1. 1 


<213> 


ORGANISM; Artificial Sequence 




219 


<220> 


FEATURE: 




1 *V rt 

22 0 


<221> 


NAME/KEY: MOD_RES 




221 


<222> 


LOCATION: (17) 




222 


<223> 


OTHER INFORMATION: AMIDATION 




224 


<220> 


FEATURE : 


synthetic 


225 


<223> 


OTHER INFORMATION: Description of Artificial Sequence": 


226 




sequence . 




228 


<400> 


SEQUENCE: 8 




229 


Lys Lys Lys Gly Ser Val Val lie Val Gly Arg He He Leu Ser Gly 


230 


1 


5 10 15 




232 


Arg 






235 


<210> 


SEQ ID NO: 9 




236 


<211> 


LENGTH: 13 




237 


<212> 


TYPE: PRT 




238 


<213> 


ORGANISM: Artificial Sequence 




240 


<220> 


FEATURE: 


Synthetic 


241 


<223> 


OTHER INFORMATION: Description of Artificial Sequence: 


242 




sequence 




244 


<400> 


SEQUENCE: 9 




245 


Asp Glu Met Glu Glu Cys Ala ser His Leu Pro Tyr Lys 




246 


1 


5 10 




249 


<210!> 


SEQ ID NO: 10 




250 


<211> 


LENGTH : 4 




251 


<212> 


TYPE: PRT - 




252 


<213>. 


ORGANISM: Artificial Sequence 




254 


<220> 


FEATURE : 




255 


<221> 


NAME/KEY: SITE 




256 


<222> 


LOCATION: (1)..(3) 




257 


<223> 


OTHER INFORMATION: Phenyalanines are linked by an ether bond 


259 


<220> 


FEATURE : 




260 


<221> 


NAME/KEY: SITE 




261 


<222> 


LOCATION: (4) 


acid 


262 


<223> 


OTHER INFORMATION: Xaa is 4 , 4-dif luoro-2-amino butyric 


264 


<220> 


FEATURE: 


Synthetic 


265 


<223> 


OTHER INFORMATION: Description of Artificial sequence: 


266 




sequence 
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RAW SEQUENCE LISTING DATE: 08/30/2001 

PATENT APPLICATION: CS/09/719, 261 TIME: 09:46:14 

input Set : A:\Mewbum-Matassa ('261) Sequence Listing. txt 
Output Set: Ni\CRF3\08302001\I719261,raW 

268 <400> SEQUENCE: 10. . • 

W--> 269 Pke GXu Piie Xaa . , 

-27.0 1 

273 <210> SEQ ID NO: 11 

274 <211> LENGTH: 6 

275 <212> TYPE: PRT - - 

276 <213> ORGANISM: Artificial sequence / >. ' 

278 <220> FEATURE: 

279 <221> NAME/KEY : SITE 

280 <222> LOCATION: (3) 

281 <223> OTHER INFORMATION: Xaa is diphenylalattine , 

283 <220> FEATURE: - ■ — ■ 

284'<221> NAME/KEY: SITE 

285 <222> LOCATION: {5} 

286 <223> OTHER INFORMATION: Xaa is cydohexylaLanine 

288 <220> FEATURE: 

289 <221> NAME/KEY: SITE 

290 <222> LOCATION: (6) 

291 <223> OTHER INFORMATION: Xaa is 3-amino-5, 5-dif luoro-pentatlbic acid 

293 <220> FEATURE: 

294 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

295 sequence ' ■ 
297 <400> SEQUENCE : 11 

W--*> 296 Asp Glu Xaa Glu Xaa Xaa 

299 1 5 

302 <210> SEQ ID NO: 12 

303 <211> LENGTH: 5 

304 <212> TYPE: PRT 

305 <213> ORGANISM: Artificial Sequence 

307 <220> FEATURE: 

308 <221> NAME/KEY: SITE 

309 <222> LOCATION: (2) 

310 <223> OTHER INFORMATION: Xaa is diphenylalanine 

312 <220> FEATURE: 

313 <221> NAME/KEY : SITE 

314 <222> LOCATION: (4) 

315 <223> OTHER INFORMATION: Xaa is cyclohexylalanine 
317 <220> .FEATURE: 

31B <221> NAME/KEY: SITE 

319 <222> LOCATION: (5) 

320 <223> OTHER INFORMATION: Xaa is 4 , 4-difluoro-2-amino butyric acid 

322 <220> FEATURE: 

323 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

324 sequence 
326 <400> SEQUENCE: 12 

W--> 327 Glu Xaa lie Xaa Xaa 

328 1 5 

331 <210> SEQ ID NO: 13 

332 <211> LENGTH: 6 
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L;16 M:271 C: 
L:54 M:341 W: 
L:74-M:34,l W: 
L:103 M:341 W 
L:132 M:343. W 
L:151 M:341 W 
L:210 M:341 W 
L;269 M:341 W 
L:29B M:341 W 
L:327 M:341 W 
L:356 M:341 W 



VERIFICATIOH SUMMAHX 

PATENT APPLICATION: OS/09/719,261 



DATE: OB/30/2001 
TIME: 09:46:15 



Input Set i AiVtewbuxn-Hatassa ('261) Sequence Listing. txt 
Output Set! Ni\CET3\08302001\l719261.raw 

Current Filing Date differs, Replaced Current Filing "bate 
(46) "n" or "xaa" used, for SEQ ID* si 
(46) "n" or "Xaa" used, for SEQID#:2 
or "Xaa" used/ for SEQ ID#:3 
for SEQ IDt;4 
for SEQ ID#:5 

for SEQ ID#:7 \ \' 

for SEQ ID#:10 
for SEQ ID#ill 
for SEQ ID*: 12 
for SEQ ID#:1S- . ' . 



(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 



"n* 
"n" 
•n' 
"n" 
"n 1 
"o' 



or 
or 



■xaa" used, 

Xaa" used, 

or "Xaa" used, 

or "xaa" used, 

"Xaa" used, 

"Xaa" used, 

"Xaa" used, 



or 
or 
or 
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